Overview
Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 700 |
| Missing cells | 140 |
| Missing cells (%) | 1.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 82.2 KiB |
| Average record size in memory | 120.2 B |
Variable types
| Text | 1 |
|---|---|
| Numeric | 7 |
| Categorical | 6 |
| DateTime | 1 |
age has 35 (5.0%) missing values | Missing |
employment_type has 35 (5.0%) missing values | Missing |
annual_income has 35 (5.0%) missing values | Missing |
credit_score has 35 (5.0%) missing values | Missing |
customer_id has unique values | Unique |
loan_amount has unique values | Unique |
join_date has unique values | Unique |
repayment_history has 63 (9.0%) zeros | Zeros |
Reproduction
| Analysis started | 2026-02-20 08:03:50.204890 |
|---|---|
| Analysis finished | 2026-02-20 08:03:54.454896 |
| Duration | 4.25 seconds |
| Software version | ydata-profiling vv4.18.1 |
| Download configuration | config.json |
Variables
customer_id
Text
Unique
| Distinct | 700 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.6 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Unique
| Unique | 700 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | CUST1000 |
|---|---|
| 2nd row | CUST1001 |
| 3rd row | CUST1002 |
| 4th row | CUST1003 |
| 5th row | CUST1004 |
| Value | Count | Frequency (%) |
| cust1000 | 1 | 0.1% |
| cust1009 | 1 | 0.1% |
| cust1010 | 1 | 0.1% |
| cust1002 | 1 | 0.1% |
| cust1003 | 1 | 0.1% |
| cust1004 | 1 | 0.1% |
| cust1005 | 1 | 0.1% |
| cust1006 | 1 | 0.1% |
| cust1007 | 1 | 0.1% |
| cust1008 | 1 | 0.1% |
| Other values (690) | 690 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 940 | |
| C | 700 | |
| U | 700 | |
| S | 700 | |
| T | 700 | |
| 0 | 240 | 4.3% |
| 4 | 240 | 4.3% |
| 6 | 240 | 4.3% |
| 2 | 240 | 4.3% |
| 3 | 240 | 4.3% |
| Other values (4) | 660 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 5600 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 940 | |
| C | 700 | |
| U | 700 | |
| S | 700 | |
| T | 700 | |
| 0 | 240 | 4.3% |
| 4 | 240 | 4.3% |
| 6 | 240 | 4.3% |
| 2 | 240 | 4.3% |
| 3 | 240 | 4.3% |
| Other values (4) | 660 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 5600 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 940 | |
| C | 700 | |
| U | 700 | |
| S | 700 | |
| T | 700 | |
| 0 | 240 | 4.3% |
| 4 | 240 | 4.3% |
| 6 | 240 | 4.3% |
| 2 | 240 | 4.3% |
| 3 | 240 | 4.3% |
| Other values (4) | 660 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 5600 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 940 | |
| C | 700 | |
| U | 700 | |
| S | 700 | |
| T | 700 | |
| 0 | 240 | 4.3% |
| 4 | 240 | 4.3% |
| 6 | 240 | 4.3% |
| 2 | 240 | 4.3% |
| 3 | 240 | 4.3% |
| Other values (4) | 660 |
age
Real number (ℝ)
Missing
| Distinct | 44 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 35 |
| Missing (%) | 5.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 42.912782 |
| Minimum | 21 |
|---|---|
| Maximum | 64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 21 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 32 |
| median | 44 |
| Q3 | 53 |
| 95-th percentile | 62 |
| Maximum | 64 |
| Range | 43 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 12.513303 |
|---|---|
| Coefficient of variation (CV) | 0.2915985 |
| Kurtosis | -1.1357423 |
| Mean | 42.912782 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | -0.12315185 |
| Sum | 28537 |
| Variance | 156.58274 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 55 | 25 | 3.6% |
| 53 | 24 | 3.4% |
| 48 | 23 | 3.3% |
| 46 | 21 | 3.0% |
| 59 | 20 | 2.9% |
| 45 | 20 | 2.9% |
| 52 | 20 | 2.9% |
| 36 | 19 | 2.7% |
| 56 | 19 | 2.7% |
| 21 | 19 | 2.7% |
| Other values (34) | 455 | |
| (Missing) | 35 | 5.0% |
| Value | Count | Frequency (%) |
| 21 | 19 | |
| 22 | 17 | |
| 23 | 15 | |
| 24 | 9 | |
| 25 | 17 | |
| 26 | 13 | |
| 27 | 13 | |
| 28 | 18 | |
| 29 | 12 | |
| 30 | 8 |
| Value | Count | Frequency (%) |
| 64 | 17 | |
| 63 | 7 | 1.0% |
| 62 | 14 | |
| 61 | 14 | |
| 60 | 13 | |
| 59 | 20 | |
| 58 | 8 | 1.1% |
| 57 | 18 | |
| 56 | 19 | |
| 55 | 25 |
gender
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.6 KiB |
| Male | |
|---|---|
| Female | |
| Other | 25 |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 4.9928571 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Other |
|---|---|
| 2nd row | Female |
| 3rd row | Female |
| 4th row | Female |
| 5th row | Female |
Common Values
| Value | Count | Frequency (%) |
| Male | 340 | |
| Female | 335 | |
| Other | 25 | 3.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 340 | |
| female | 335 | |
| other | 25 | 3.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1035 | |
| a | 675 | |
| l | 675 | |
| M | 340 | 9.7% |
| F | 335 | 9.6% |
| m | 335 | 9.6% |
| O | 25 | 0.7% |
| t | 25 | 0.7% |
| h | 25 | 0.7% |
| r | 25 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3495 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 1035 | |
| a | 675 | |
| l | 675 | |
| M | 340 | 9.7% |
| F | 335 | 9.6% |
| m | 335 | 9.6% |
| O | 25 | 0.7% |
| t | 25 | 0.7% |
| h | 25 | 0.7% |
| r | 25 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3495 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 1035 | |
| a | 675 | |
| l | 675 | |
| M | 340 | 9.7% |
| F | 335 | 9.6% |
| m | 335 | 9.6% |
| O | 25 | 0.7% |
| t | 25 | 0.7% |
| h | 25 | 0.7% |
| r | 25 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3495 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 1035 | |
| a | 675 | |
| l | 675 | |
| M | 340 | 9.7% |
| F | 335 | 9.6% |
| m | 335 | 9.6% |
| O | 25 | 0.7% |
| t | 25 | 0.7% |
| h | 25 | 0.7% |
| r | 25 | 0.7% |
region
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.6 KiB |
| South | |
|---|---|
| North | |
| West | |
| East |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.5185714 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | North |
|---|---|
| 2nd row | East |
| 3rd row | West |
| 4th row | North |
| 5th row | North |
Common Values
| Value | Count | Frequency (%) |
| South | 184 | |
| North | 179 | |
| West | 176 | |
| East | 161 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| south | 184 | |
| north | 179 | |
| west | 176 | |
| east | 161 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 700 | |
| o | 363 | |
| h | 363 | |
| s | 337 | |
| S | 184 | 5.8% |
| u | 184 | 5.8% |
| N | 179 | 5.7% |
| r | 179 | 5.7% |
| W | 176 | 5.6% |
| e | 176 | 5.6% |
| Other values (2) | 322 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3163 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 700 | |
| o | 363 | |
| h | 363 | |
| s | 337 | |
| S | 184 | 5.8% |
| u | 184 | 5.8% |
| N | 179 | 5.7% |
| r | 179 | 5.7% |
| W | 176 | 5.6% |
| e | 176 | 5.6% |
| Other values (2) | 322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3163 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 700 | |
| o | 363 | |
| h | 363 | |
| s | 337 | |
| S | 184 | 5.8% |
| u | 184 | 5.8% |
| N | 179 | 5.7% |
| r | 179 | 5.7% |
| W | 176 | 5.6% |
| e | 176 | 5.6% |
| Other values (2) | 322 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3163 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 700 | |
| o | 363 | |
| h | 363 | |
| s | 337 | |
| S | 184 | 5.8% |
| u | 184 | 5.8% |
| N | 179 | 5.7% |
| r | 179 | 5.7% |
| W | 176 | 5.6% |
| e | 176 | 5.6% |
| Other values (2) | 322 |
education_level
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.6 KiB |
| Graduate | |
|---|---|
| Secondary | |
| Post-Graduate | |
| Primary |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 9.1628571 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Graduate |
|---|---|
| 2nd row | Graduate |
| 3rd row | Post-Graduate |
| 4th row | Secondary |
| 5th row | Graduate |
Common Values
| Value | Count | Frequency (%) |
| Graduate | 296 | |
| Secondary | 192 | |
| Post-Graduate | 139 | |
| Primary | 73 | 10.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| graduate | 296 | |
| secondary | 192 | |
| post-graduate | 139 | |
| primary | 73 | 10.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1135 | |
| r | 773 | |
| d | 627 | |
| e | 627 | |
| t | 574 | |
| G | 435 | 6.8% |
| u | 435 | 6.8% |
| o | 331 | 5.2% |
| y | 265 | 4.1% |
| P | 212 | 3.3% |
| Other values (7) | 1000 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6414 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| a | 1135 | |
| r | 773 | |
| d | 627 | |
| e | 627 | |
| t | 574 | |
| G | 435 | 6.8% |
| u | 435 | 6.8% |
| o | 331 | 5.2% |
| y | 265 | 4.1% |
| P | 212 | 3.3% |
| Other values (7) | 1000 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6414 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| a | 1135 | |
| r | 773 | |
| d | 627 | |
| e | 627 | |
| t | 574 | |
| G | 435 | 6.8% |
| u | 435 | 6.8% |
| o | 331 | 5.2% |
| y | 265 | 4.1% |
| P | 212 | 3.3% |
| Other values (7) | 1000 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6414 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| a | 1135 | |
| r | 773 | |
| d | 627 | |
| e | 627 | |
| t | 574 | |
| G | 435 | 6.8% |
| u | 435 | 6.8% |
| o | 331 | 5.2% |
| y | 265 | 4.1% |
| P | 212 | 3.3% |
| Other values (7) | 1000 |
employment_type
Categorical
Missing
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 35 |
| Missing (%) | 5.0% |
| Memory size | 5.6 KiB |
| Salaried | |
|---|---|
| Self-Employed | |
| Unemployed |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 9.6496241 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Salaried |
|---|---|
| 2nd row | Salaried |
| 3rd row | Salaried |
| 4th row | Unemployed |
| 5th row | Salaried |
Common Values
| Value | Count | Frequency (%) |
| Salaried | 403 | |
| Self-Employed | 191 | |
| Unemployed | 71 | 10.1% |
| (Missing) | 35 | 5.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| salaried | 403 | |
| self-employed | 191 | |
| unemployed | 71 | 10.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 927 | |
| l | 856 | |
| a | 806 | |
| d | 665 | |
| S | 594 | |
| r | 403 | |
| i | 403 | |
| m | 262 | 4.1% |
| p | 262 | 4.1% |
| o | 262 | 4.1% |
| Other values (6) | 977 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 6417 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 927 | |
| l | 856 | |
| a | 806 | |
| d | 665 | |
| S | 594 | |
| r | 403 | |
| i | 403 | |
| m | 262 | 4.1% |
| p | 262 | 4.1% |
| o | 262 | 4.1% |
| Other values (6) | 977 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 6417 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 927 | |
| l | 856 | |
| a | 806 | |
| d | 665 | |
| S | 594 | |
| r | 403 | |
| i | 403 | |
| m | 262 | 4.1% |
| p | 262 | 4.1% |
| o | 262 | 4.1% |
| Other values (6) | 977 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 6417 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 927 | |
| l | 856 | |
| a | 806 | |
| d | 665 | |
| S | 594 | |
| r | 403 | |
| i | 403 | |
| m | 262 | 4.1% |
| p | 262 | 4.1% |
| o | 262 | 4.1% |
| Other values (6) | 977 |
annual_income
Real number (ℝ)
Missing
| Distinct | 665 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 35 |
| Missing (%) | 5.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 619136.36 |
| Minimum | 1772.81 |
|---|---|
| Maximum | 2743811.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 1772.81 |
|---|---|
| 5-th percentile | 279508.33 |
| Q1 | 472538.45 |
| median | 595289.57 |
| Q3 | 736665.8 |
| 95-th percentile | 951598.31 |
| Maximum | 2743811.9 |
| Range | 2742039 |
| Interquartile range (IQR) | 264127.35 |
Descriptive statistics
| Standard deviation | 265346.59 |
|---|---|
| Coefficient of variation (CV) | 0.42857537 |
| Kurtosis | 15.921147 |
| Mean | 619136.36 |
| Median Absolute Deviation (MAD) | 133235.12 |
| Skewness | 2.611561 |
| Sum | 4.1172568 × 108 |
| Variance | 7.0408815 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 778076.63 | 1 | 0.1% |
| 528074.07 | 1 | 0.1% |
| 660221.47 | 1 | 0.1% |
| 636766.9 | 1 | 0.1% |
| 1138606.73 | 1 | 0.1% |
| 669960 | 1 | 0.1% |
| 399189.08 | 1 | 0.1% |
| 1742721.45 | 1 | 0.1% |
| 244750.73 | 1 | 0.1% |
| 583880.05 | 1 | 0.1% |
| Other values (655) | 655 | |
| (Missing) | 35 | 5.0% |
| Value | Count | Frequency (%) |
| 1772.81 | 1 | |
| 33568.88 | 1 | |
| 50499.03 | 1 | |
| 72850.45 | 1 | |
| 94087.98 | 1 | |
| 102334.53 | 1 | |
| 150622.13 | 1 | |
| 152353.75 | 1 | |
| 156139.93 | 1 | |
| 158886.79 | 1 |
| Value | Count | Frequency (%) |
| 2743811.85 | 1 | |
| 2538524.7 | 1 | |
| 2274223.14 | 1 | |
| 2210141.58 | 1 | |
| 2131950.72 | 1 | |
| 1858840.53 | 1 | |
| 1742721.45 | 1 | |
| 1620097.08 | 1 | |
| 1385247.54 | 1 | |
| 1329740.61 | 1 |
loan_amount
Real number (ℝ)
Unique
| Distinct | 700 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 310137.89 |
| Minimum | -139417.3 |
|---|---|
| Maximum | 1088749.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 18 |
| Negative (%) | 2.6% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | -139417.3 |
|---|---|
| 5-th percentile | 47566.058 |
| Q1 | 192987.06 |
| median | 310955.33 |
| Q3 | 414900.36 |
| 95-th percentile | 572795.98 |
| Maximum | 1088749.9 |
| Range | 1228167.2 |
| Interquartile range (IQR) | 221913.29 |
Descriptive statistics
| Standard deviation | 164238.02 |
|---|---|
| Coefficient of variation (CV) | 0.52956453 |
| Kurtosis | 1.1643587 |
| Mean | 310137.89 |
| Median Absolute Deviation (MAD) | 113039.02 |
| Skewness | 0.35694103 |
| Sum | 2.1709652 × 108 |
| Variance | 2.6974128 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 308847.63 | 1 | 0.1% |
| 449464.74 | 1 | 0.1% |
| 422325.15 | 1 | 0.1% |
| 292786.72 | 1 | 0.1% |
| 272527.43 | 1 | 0.1% |
| 246499.03 | 1 | 0.1% |
| 477096.14 | 1 | 0.1% |
| 205902.98 | 1 | 0.1% |
| 306783.41 | 1 | 0.1% |
| 307679.69 | 1 | 0.1% |
| Other values (690) | 690 |
| Value | Count | Frequency (%) |
| -139417.3 | 1 | |
| -136048.23 | 1 | |
| -90470.56 | 1 | |
| -79836.91 | 1 | |
| -73990.62 | 1 | |
| -73171.3 | 1 | |
| -47421.58 | 1 | |
| -32522.83 | 1 | |
| -28488.26 | 1 | |
| -22857.53 | 1 |
| Value | Count | Frequency (%) |
| 1088749.9 | 1 | |
| 1078332.72 | 1 | |
| 958428.86 | 1 | |
| 814382.94 | 1 | |
| 786463.95 | 1 | |
| 739963.6 | 1 | |
| 730260.46 | 1 | |
| 676733.48 | 1 | |
| 672778.49 | 1 | |
| 648006.11 | 1 |
loan_purpose
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.6 KiB |
| Car | |
|---|---|
| Business | |
| Other | |
| Home | |
| Education |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.5428571 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Home |
|---|---|
| 2nd row | Home |
| 3rd row | Other |
| 4th row | Other |
| 5th row | Business |
Common Values
| Value | Count | Frequency (%) |
| Car | 174 | |
| Business | 139 | |
| Other | 138 | |
| Home | 137 | |
| Education | 112 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| car | 174 | |
| business | 139 | |
| other | 138 | |
| home | 137 | |
| education | 112 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 417 | |
| e | 414 | |
| r | 312 | 8.0% |
| a | 286 | 7.4% |
| u | 251 | 6.5% |
| i | 251 | 6.5% |
| n | 251 | 6.5% |
| t | 250 | 6.4% |
| o | 249 | 6.4% |
| C | 174 | 4.5% |
| Other values (8) | 1025 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3880 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| s | 417 | |
| e | 414 | |
| r | 312 | 8.0% |
| a | 286 | 7.4% |
| u | 251 | 6.5% |
| i | 251 | 6.5% |
| n | 251 | 6.5% |
| t | 250 | 6.4% |
| o | 249 | 6.4% |
| C | 174 | 4.5% |
| Other values (8) | 1025 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3880 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| s | 417 | |
| e | 414 | |
| r | 312 | 8.0% |
| a | 286 | 7.4% |
| u | 251 | 6.5% |
| i | 251 | 6.5% |
| n | 251 | 6.5% |
| t | 250 | 6.4% |
| o | 249 | 6.4% |
| C | 174 | 4.5% |
| Other values (8) | 1025 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3880 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| s | 417 | |
| e | 414 | |
| r | 312 | 8.0% |
| a | 286 | 7.4% |
| u | 251 | 6.5% |
| i | 251 | 6.5% |
| n | 251 | 6.5% |
| t | 250 | 6.4% |
| o | 249 | 6.4% |
| C | 174 | 4.5% |
| Other values (8) | 1025 |
credit_score
Real number (ℝ)
Missing
| Distinct | 198 |
|---|---|
| Distinct (%) | 29.8% |
| Missing | 35 |
| Missing (%) | 5.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 676.93684 |
| Minimum | 521 |
|---|---|
| Maximum | 818 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 521 |
|---|---|
| 5-th percentile | 591.4 |
| Q1 | 646 |
| median | 679 |
| Q3 | 713 |
| 95-th percentile | 751.8 |
| Maximum | 818 |
| Range | 297 |
| Interquartile range (IQR) | 67 |
Descriptive statistics
| Standard deviation | 49.646299 |
|---|---|
| Coefficient of variation (CV) | 0.073339633 |
| Kurtosis | 0.081721524 |
| Mean | 676.93684 |
| Median Absolute Deviation (MAD) | 34 |
| Skewness | -0.24805826 |
| Sum | 450163 |
| Variance | 2464.755 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 705 | 12 | 1.7% |
| 656 | 10 | 1.4% |
| 677 | 10 | 1.4% |
| 690 | 9 | 1.3% |
| 695 | 9 | 1.3% |
| 651 | 9 | 1.3% |
| 706 | 8 | 1.1% |
| 657 | 8 | 1.1% |
| 701 | 8 | 1.1% |
| 713 | 8 | 1.1% |
| Other values (188) | 574 | |
| (Missing) | 35 | 5.0% |
| Value | Count | Frequency (%) |
| 521 | 2 | |
| 530 | 1 | |
| 537 | 1 | |
| 538 | 2 | |
| 549 | 1 | |
| 555 | 1 | |
| 559 | 1 | |
| 560 | 1 | |
| 561 | 2 | |
| 564 | 1 |
| Value | Count | Frequency (%) |
| 818 | 1 | |
| 807 | 1 | |
| 799 | 1 | |
| 797 | 2 | |
| 790 | 1 | |
| 788 | 1 | |
| 785 | 2 | |
| 784 | 1 | |
| 782 | 1 | |
| 777 | 2 |
repayment_history
Real number (ℝ)
Zeros
| Distinct | 12 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.68 |
| Minimum | 0 |
|---|---|
| Maximum | 11 |
| Zeros | 63 |
| Zeros (%) | 9.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3 |
| median | 6 |
| Q3 | 9 |
| 95-th percentile | 11 |
| Maximum | 11 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.4769464 |
|---|---|
| Coefficient of variation (CV) | 0.61213844 |
| Kurtosis | -1.1631683 |
| Mean | 5.68 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.091638119 |
| Sum | 3976 |
| Variance | 12.089156 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 6 | 72 | |
| 11 | 67 | |
| 5 | 66 | |
| 7 | 65 | |
| 10 | 64 | |
| 0 | 63 | |
| 1 | 55 | |
| 8 | 54 | |
| 9 | 52 | |
| 3 | 49 | |
| Other values (2) | 93 |
| Value | Count | Frequency (%) |
| 0 | 63 | |
| 1 | 55 | |
| 2 | 46 | |
| 3 | 49 | |
| 4 | 47 | |
| 5 | 66 | |
| 6 | 72 | |
| 7 | 65 | |
| 8 | 54 | |
| 9 | 52 |
| Value | Count | Frequency (%) |
| 11 | 67 | |
| 10 | 64 | |
| 9 | 52 | |
| 8 | 54 | |
| 7 | 65 | |
| 6 | 72 | |
| 5 | 66 | |
| 4 | 47 | |
| 3 | 49 | |
| 2 | 46 |
transaction_count
Real number (ℝ)
| Distinct | 182 |
|---|---|
| Distinct (%) | 26.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.98429 |
| Minimum | 10 |
|---|---|
| Maximum | 199 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 22.95 |
| Q1 | 57 |
| median | 100.5 |
| Q3 | 150 |
| 95-th percentile | 189.1 |
| Maximum | 199 |
| Range | 189 |
| Interquartile range (IQR) | 93 |
Descriptive statistics
| Standard deviation | 53.83593 |
|---|---|
| Coefficient of variation (CV) | 0.52275869 |
| Kurtosis | -1.1814277 |
| Mean | 102.98429 |
| Median Absolute Deviation (MAD) | 46.5 |
| Skewness | 0.065344068 |
| Sum | 72089 |
| Variance | 2898.3073 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24 | 10 | 1.4% |
| 114 | 9 | 1.3% |
| 103 | 9 | 1.3% |
| 199 | 8 | 1.1% |
| 23 | 8 | 1.1% |
| 139 | 8 | 1.1% |
| 93 | 7 | 1.0% |
| 32 | 7 | 1.0% |
| 91 | 7 | 1.0% |
| 34 | 7 | 1.0% |
| Other values (172) | 620 |
| Value | Count | Frequency (%) |
| 10 | 6 | |
| 11 | 1 | 0.1% |
| 12 | 3 | |
| 13 | 2 | 0.3% |
| 16 | 1 | 0.1% |
| 17 | 5 | |
| 18 | 5 | |
| 19 | 7 | |
| 20 | 1 | 0.1% |
| 21 | 2 | 0.3% |
| Value | Count | Frequency (%) |
| 199 | 8 | |
| 198 | 3 | 0.4% |
| 197 | 6 | |
| 196 | 3 | 0.4% |
| 195 | 5 | |
| 194 | 1 | 0.1% |
| 193 | 2 | 0.3% |
| 192 | 2 | 0.3% |
| 191 | 5 | |
| 189 | 2 | 0.3% |
spending_ratio
Real number (ℝ)
| Distinct | 668 |
|---|---|
| Distinct (%) | 95.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51.3125 |
| Minimum | 10.41 |
|---|---|
| Maximum | 89.86 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 5.6 KiB |
Quantile statistics
| Minimum | 10.41 |
|---|---|
| 5-th percentile | 14.0495 |
| Q1 | 31.515 |
| median | 52.35 |
| Q3 | 71.7325 |
| 95-th percentile | 86.732 |
| Maximum | 89.86 |
| Range | 79.45 |
| Interquartile range (IQR) | 40.2175 |
Descriptive statistics
| Standard deviation | 23.370036 |
|---|---|
| Coefficient of variation (CV) | 0.45544528 |
| Kurtosis | -1.233163 |
| Mean | 51.3125 |
| Median Absolute Deviation (MAD) | 20.095 |
| Skewness | -0.10474862 |
| Sum | 35918.75 |
| Variance | 546.15858 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 68.74 | 3 | 0.4% |
| 83.16 | 2 | 0.3% |
| 60.44 | 2 | 0.3% |
| 70.29 | 2 | 0.3% |
| 44.53 | 2 | 0.3% |
| 80.7 | 2 | 0.3% |
| 62.25 | 2 | 0.3% |
| 17.52 | 2 | 0.3% |
| 33.84 | 2 | 0.3% |
| 86.95 | 2 | 0.3% |
| Other values (658) | 679 |
| Value | Count | Frequency (%) |
| 10.41 | 1 | |
| 10.43 | 1 | |
| 10.44 | 1 | |
| 10.52 | 1 | |
| 10.64 | 1 | |
| 10.72 | 1 | |
| 10.86 | 1 | |
| 10.91 | 1 | |
| 11.08 | 1 | |
| 11.1 | 1 |
| Value | Count | Frequency (%) |
| 89.86 | 1 | |
| 89.81 | 1 | |
| 89.7 | 1 | |
| 89.69 | 1 | |
| 89.57 | 1 | |
| 89.42 | 1 | |
| 89.41 | 1 | |
| 89.18 | 1 | |
| 88.91 | 1 | |
| 88.79 | 1 |
join_date
Date
Unique
| Distinct | 700 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.6 KiB |
| Minimum | 2015-01-05 02:23:02 |
|---|---|
| Maximum | 2023-12-28 10:57:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 563 | |
| 1 | 137 | 19.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 563 | |
| 1 | 137 | 19.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 563 | |
| 1 | 137 | 19.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 700 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 563 | |
| 1 | 137 | 19.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 700 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 563 | |
| 1 | 137 | 19.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 700 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 563 | |
| 1 | 137 | 19.6% |
Interactions
Correlations
| age | annual_income | credit_score | default_flag | education_level | employment_type | gender | loan_amount | loan_purpose | region | repayment_history | spending_ratio | transaction_count | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| age | 1.000 | -0.020 | 0.029 | 0.064 | 0.000 | 0.077 | 0.052 | -0.071 | 0.064 | 0.000 | 0.007 | 0.091 | 0.018 |
| annual_income | -0.020 | 1.000 | -0.048 | 0.000 | 0.039 | 0.000 | 0.000 | 0.034 | 0.042 | 0.044 | -0.003 | -0.029 | 0.009 |
| credit_score | 0.029 | -0.048 | 1.000 | 0.000 | 0.048 | 0.047 | 0.000 | 0.087 | 0.028 | 0.101 | 0.016 | -0.086 | 0.022 |
| default_flag | 0.064 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.109 | 0.000 | 0.082 | 0.000 | 0.000 |
| education_level | 0.000 | 0.039 | 0.048 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| employment_type | 0.077 | 0.000 | 0.047 | 0.000 | 0.000 | 1.000 | 0.020 | 0.000 | 0.077 | 0.000 | 0.041 | 0.000 | 0.059 |
| gender | 0.052 | 0.000 | 0.000 | 0.000 | 0.000 | 0.020 | 1.000 | 0.029 | 0.000 | 0.000 | 0.042 | 0.000 | 0.000 |
| loan_amount | -0.071 | 0.034 | 0.087 | 0.000 | 0.000 | 0.000 | 0.029 | 1.000 | 0.000 | 0.000 | 0.014 | -0.062 | 0.031 |
| loan_purpose | 0.064 | 0.042 | 0.028 | 0.109 | 0.000 | 0.077 | 0.000 | 0.000 | 1.000 | 0.000 | 0.014 | 0.000 | 0.000 |
| region | 0.000 | 0.044 | 0.101 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.000 | 0.000 | 0.000 |
| repayment_history | 0.007 | -0.003 | 0.016 | 0.082 | 0.000 | 0.041 | 0.042 | 0.014 | 0.014 | 0.000 | 1.000 | 0.001 | -0.013 |
| spending_ratio | 0.091 | -0.029 | -0.086 | 0.000 | 0.000 | 0.000 | 0.000 | -0.062 | 0.000 | 0.000 | 0.001 | 1.000 | -0.001 |
| transaction_count | 0.018 | 0.009 | 0.022 | 0.000 | 0.000 | 0.059 | 0.000 | 0.031 | 0.000 | 0.000 | -0.013 | -0.001 | 1.000 |
Missing values
Sample
| customer_id | age | gender | region | education_level | employment_type | annual_income | loan_amount | loan_purpose | credit_score | repayment_history | transaction_count | spending_ratio | join_date | default_flag | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | CUST1000 | 59.0 | Other | North | Graduate | Salaried | 778076.63 | 308847.63 | Home | 603.0 | 0 | 179 | 55.69 | 2023-06-27 07:33:00 | 0 |
| 1 | CUST1001 | 49.0 | Female | East | Graduate | Salaried | 715041.00 | 367030.88 | Home | 672.0 | 10 | 48 | 10.72 | 2017-06-24 15:17:32 | 1 |
| 2 | CUST1002 | 35.0 | Female | West | Post-Graduate | Salaried | 700133.14 | 248617.62 | Other | 656.0 | 1 | 148 | 30.62 | 2022-11-05 08:27:53 | 0 |
| 3 | CUST1003 | 63.0 | Female | North | Secondary | NaN | 609954.74 | 325569.57 | Other | NaN | 5 | 45 | 42.55 | 2016-01-07 10:41:09 | 0 |
| 4 | CUST1004 | 28.0 | Female | North | Graduate | Unemployed | 601412.63 | 155590.12 | Business | 671.0 | 5 | 61 | 46.81 | 2019-03-13 07:12:07 | 1 |
| 5 | CUST1005 | 41.0 | Male | West | Graduate | Salaried | 467935.77 | 269008.31 | Education | 705.0 | 2 | 40 | 33.81 | 2020-03-18 16:40:03 | 1 |
| 6 | CUST1006 | 59.0 | Male | West | Graduate | Salaried | 739765.68 | 391532.43 | Home | 680.0 | 11 | 42 | 12.64 | 2015-07-28 23:58:16 | 0 |
| 7 | CUST1007 | 39.0 | Male | South | Graduate | Salaried | 684194.59 | 323545.94 | Other | 695.0 | 3 | 150 | 57.21 | 2016-10-30 20:31:39 | 0 |
| 8 | CUST1008 | 43.0 | Female | East | Graduate | Self-Employed | 698403.77 | 212020.31 | Home | 771.0 | 11 | 155 | 51.93 | 2016-10-08 17:47:39 | 1 |
| 9 | CUST1009 | 31.0 | Female | West | Post-Graduate | Salaried | 494793.03 | 333632.46 | Other | 663.0 | 1 | 178 | 32.64 | 2020-01-05 02:53:34 | 0 |
| customer_id | age | gender | region | education_level | employment_type | annual_income | loan_amount | loan_purpose | credit_score | repayment_history | transaction_count | spending_ratio | join_date | default_flag | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 690 | CUST1690 | 61.0 | Male | East | Graduate | Self-Employed | 226051.66 | 206694.78 | Other | 656.0 | 2 | 77 | 61.84 | 2018-12-13 17:52:40 | 1 |
| 691 | CUST1691 | 55.0 | Female | East | Graduate | Self-Employed | 522295.84 | 387410.64 | Other | 639.0 | 9 | 151 | 37.10 | 2020-11-03 21:10:51 | 0 |
| 692 | CUST1692 | 21.0 | Male | West | Post-Graduate | Salaried | 638084.78 | 374531.04 | Education | 646.0 | 0 | 126 | 47.60 | 2020-08-10 21:50:16 | 0 |
| 693 | CUST1693 | 41.0 | Female | East | Graduate | Salaried | 689843.74 | 460465.72 | Other | 612.0 | 11 | 115 | 78.63 | 2016-05-15 05:19:37 | 0 |
| 694 | CUST1694 | 26.0 | Male | West | Graduate | Self-Employed | 498038.66 | 120045.37 | Car | 676.0 | 3 | 102 | 33.88 | 2022-06-10 03:04:28 | 0 |
| 695 | CUST1695 | 48.0 | Male | East | Graduate | Salaried | 606888.20 | -47421.58 | Business | 596.0 | 9 | 74 | 83.90 | 2023-09-22 21:06:53 | 0 |
| 696 | CUST1696 | 37.0 | Other | South | Graduate | Self-Employed | 102334.53 | 428702.13 | Business | 690.0 | 11 | 77 | 28.38 | 2016-07-17 03:17:52 | 0 |
| 697 | CUST1697 | NaN | Male | North | Primary | Unemployed | 468350.32 | 175770.56 | Business | NaN | 5 | 137 | 69.96 | 2023-11-09 18:15:09 | 0 |
| 698 | CUST1698 | 51.0 | Female | South | Secondary | Salaried | 690701.74 | -2773.18 | Business | 687.0 | 10 | 72 | 60.44 | 2023-02-17 02:30:07 | 0 |
| 699 | CUST1699 | 25.0 | Male | North | Secondary | Salaried | 403541.19 | 321759.56 | Education | 654.0 | 11 | 122 | 69.97 | 2019-11-08 23:54:55 | 1 |